Proceedings of the Workshop on Innovative Corpus

نویسندگان

  • Nils Blomqvist
  • Gintarė Grigonytė
  • Simon Clematide
  • Andrius Utka
  • Martin Volk
چکیده

The task-oriented and format-driven development of corpus query systems has led to the creation of numerous corpus query languages (QLs) that vary strongly in expressiveness and syntax. This is a severe impediment for the interoperability of corpus analysis systems, which lack a common protocol. In this paper, we present KoralQuery, a JSON-LD based general corpus query protocol, aiming to be independent of particular QLs, tasks and corpus formats. In addition to describing the system of types and operations that KoralQuery is built on, we exemplify the representation of corpus queries in the serialized format and illustrate use cases in the KorAP project.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی تأثیر کارگاه آموزشی مهارت‌های تدریس بر کیفیت تدریس دستیاران

Studies show that residents play an important role in teaching medical students and as there is a large number of contact hours among them, they are assumed as the leaders of educational team. So developing teaching skills, being familiar with innovative teaching methods, knowing how to increase the educational efficacy, providing teaching objectives and educational spiral are the n...

متن کامل

Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language

In this paper, we describe how the TEITOK corpus tools helped to create a diachronic corpus for Old Spanish that contains both paleographic and linguistic information, which is easy to use for nonspecialists, and in which it is easy to perform manual improvements to automatically assigned POS tags and lemmas.

متن کامل

A Novel Drawing Method for Innovative Design of Karbandi Case Study: The Karbandis of Tabriz Historic Bazaar

Karbandi is a common structural and covering pattern for arched surfaces in Persian architecture, which is rooted in the precise methods of descriptive geometry. These methods, due to their strict geometry, do not have much flexibility and have been used only in specific fields in Iranian architecture. Therefore, the questions arise: what are the limitations and requirements of common drawing m...

متن کامل

Stereotyping and Bias in the Flickr30K Dataset

In: Proceedings of the Workshop on Multimodal Corpora: Computer vision and language processing (MMC-2016), pages 1–4. Workshop held: 24 May 2016, collocated with LREC 2016, Portorož, Slovenia. Proceedings available at: http://www.lrec-conf.org/proceedings/lrec2016/workshops/ LREC2016Workshop-MCC-2016-proceedings.pdf An untested assumption behind the crowdsourced descriptions of the images in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015